Search CORE

49 research outputs found

Multiple testing correction in linear mixed models.

Author: Eskin Eleazar
Han Buhm
Hormozdiari Farhad
Joo Jong Wha J
Publication venue: eScholarship, University of California
Publication date: 01/04/2016
Field of study

BackgroundMultiple hypothesis testing is a major issue in genome-wide association studies (GWAS), which often analyze millions of markers. The permutation test is considered to be the gold standard in multiple testing correction as it accurately takes into account the correlation structure of the genome. Recently, the linear mixed model (LMM) has become the standard practice in GWAS, addressing issues of population structure and insufficient power. However, none of the current multiple testing approaches are applicable to LMM.ResultsWe were able to estimate per-marker thresholds as accurately as the gold standard approach in real and simulated datasets, while reducing the time required from months to hours. We applied our approach to mouse, yeast, and human datasets to demonstrate the accuracy and efficiency of our approach.ConclusionsWe provide an efficient and accurate multiple testing correction approach for linear mixed models. We further provide an intuition about the relationships between per-marker threshold, genetic relatedness, and heritability, based on our observations in real data

SNU Open Repository and Archive

PubMed Central

eScholarship - University of California

Privacy preserving protocol for detecting genetic relatives using rare variants.

Author: Eskin Eleazar
Guan Feng
Hormozdiari Farhad
Joo Jong Wha J
Ostrosky Rafail
Sahai Amit
Wadia Akshay
Publication venue: eScholarship, University of California
Publication date: 01/06/2014
Field of study

MotivationHigh-throughput sequencing technologies have impacted many areas of genetic research. One such area is the identification of relatives from genetic data. The standard approach for the identification of genetic relatives collects the genomic data of all individuals and stores it in a database. Then, each pair of individuals is compared to detect the set of genetic relatives, and the matched individuals are informed. The main drawback of this approach is the requirement of sharing your genetic data with a trusted third party to perform the relatedness test.ResultsIn this work, we propose a secure protocol to detect the genetic relatives from sequencing data while not exposing any information about their genomes. We assume that individuals have access to their genome sequences but do not want to share their genomes with anyone else. Unlike previous approaches, our approach uses both common and rare variants which provide the ability to detect much more distant relationships securely. We use a simulated data generated from the 1000 genomes data and illustrate that we can easily detect up to fifth degree cousins which was not possible using the existing methods. We also show in the 1000 genomes data with cryptic relationships that our method can detect these individuals.AvailabilityThe software is freely available for download at http://genetics.cs.ucla.edu/crypto/

PubMed Central

eScholarship - University of California

Effectively identifying regulatory hotspots while capturing expression heterogeneity in gene expression studies

Author: Eskin Eleazar
Han Buhm
J Joo Jong Wha
Sul Jae Hoon
Ye Chun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Expression quantitative trait loci (eQTL) mapping is a tool that can systematically identify genetic variation affecting gene expression. eQTL mapping studies have shown that certain genomic locations, referred to as regulatory hotspots, may affect the expression levels of many genes. Recently, studies have shown that various confounding factors may induce spurious regulatory hotspots. Here, we introduce a novel statistical method that effectively eliminates spurious hotspots while retaining genuine hotspots. Applied to simulated and real datasets, we validate that our method achieves greater sensitivity while retaining low false discovery rates compared to previous methods

Crossref

SNU Open Repository and Archive

Harvard University - DASH

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

Meta-Analysis Identifies Gene-by-Environment Interactions as Demonstrated in a Study of 4,965 Mice

Author: Davis Richard C.
Eskin Eleazar
Furlotte Nicholas
Han Buhm
Joo Jong Wha J.
Kang Eun Yong
Lusis Aldons J.
Shih Diana
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2014
Field of study

Identifying environmentally-specific genetic effects is a key challenge in understanding the structure of complex traits. Model organisms play a crucial role in the identification of such gene-by-environment interactions, as a result of the unique ability to observe genetically similar individuals across multiple distinct environments. Many model organism studies examine the same traits but under varying environmental conditions. For example, knock-out or diet-controlled studies are often used to examine cholesterol in mice. These studies, when examined in aggregate, provide an opportunity to identify genomic loci exhibiting environmentally-dependent effects. However, the straightforward application of traditional methodologies to aggregate separate studies suffers from several problems. First, environmental conditions are often variable and do not fit the standard univariate model for interactions. Additionally, applying a multivariate model results in increased degrees of freedom and low statistical power. In this paper, we jointly analyze multiple studies with varying environmental conditions using a meta-analytic approach based on a random effects model to identify loci involved in gene-by-environment interactions. Our approach is motivated by the observation that methods for discovering gene-by-environment interactions are closely related to random effects models for meta-analysis. We show that interactions can be interpreted as heterogeneity and can be detected without utilizing the traditional uni- or multi-variate approaches for discovery of gene-by-environment interactions. We apply our new method to combine 17 mouse studies containing in aggregate 4,965 distinct animals. We identify 26 significant loci involved in High-density lipoprotein (HDL) cholesterol, many of which are consistent with previous findings. Several of these loci show significant evidence of involvement in gene-by-environment interactions. An additional advantage of our meta-analysis approach is that our combined study has significantly higher power and improved resolution compared to any single study thus explaining the large number of loci discovered in the combined study

SNU Open Repository and Archive

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

FigShare

Multiple testing correction in linear mixed models

Author: A Cortes
A Genz
A Genz
A Kirby
A Köttgen
AM Davie
B Han
B Pasaniuc
BJ Bennett
BL Browning
Buhm Han
BW Parks
BZ He
C Lippert
C Sabatti
CC Park
CR Farber
D Altshuler
D Lee
DE Reich
DL Aylor
DY Lin
E Kostem
E Org
E Zeggini
EK Speliotes
Eleazar Eskin
EN Smith
F Hormozdiari
F Hormozdiari
F Le Gall
Farhad Hormozdiari
G Consortium
G Kichaev
GR Abecasis
H Hakonarson
HM Kang
HM Kang
J Flint
J Hagmann
J Listgarten
J Yang
J Yang
J Yu
JH Sul
Jong Wha J. Joo
JWJ Joo
JWJ Joo
KN Conneely
M Abney
M Fakiola
MI McCarthy
N Fusi
N Zaitlen
NA Bokulich
NA Furlotte
P-RR Loh
R Sladek
RA Gibbs
RB Brem
S Ripke
SR Seaman
V Hajivassiliou
V Williams
W Chen
W Huang
W Valdar
W Zhang
X Gao
X Zhou
Y Lu
Y Okada
Z Sidák
Z Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Efficient and Accurate Multiple-Phenotype Regression Method for High Dimensional Data Considering Population Structure

Author: Joo Jong Wha J,
Publication venue
Publication date: 05/05/2023
Field of study

Ezid

Recommended from our members

Effectively identifying regulatory hotspots while capturing expression heterogeneity in gene expression studies.

Author: Eskin Eleazar
Han Buhm
Joo Jong Wha J
Sul Jae Hoon
Ye Chun
Publication venue: eScholarship, University of California
Publication date: 01/04/2014
Field of study

eScholarship - University of California

Recommended from our members

Efficient and Accurate Multiple-Phenotype Regression Method for High Dimensional Data Considering Population Structure

Author: Eskin Eleazar
Furlotte Nick
Hormozdiari Farhad
Joo Jong Wha J
Kang Eun Yong
Lusis Aldons J
Org Elin
Parks Brian
Publication venue: eScholarship, University of California
Publication date: 01/12/2016
Field of study

A typical genome-wide association study tests correlation between a single phenotype and each genotype one at a time. However, single-phenotype analysis might miss unmeasured aspects of complex biological networks. Analyzing many phenotypes simultaneously may increase the power to capture these unmeasured aspects and detect more variants. Several multivariate approaches aim to detect variants related to more than one phenotype, but these current approaches do not consider the effects of population structure. As a result, these approaches may result in a significant amount of false positive identifications. Here, we introduce a new methodology, referred to as GAMMA for generalized analysis of molecular variance for mixed-model analysis, which is capable of simultaneously analyzing many phenotypes and correcting for population structure. In a simulated study using data implanted with true genetic effects, GAMMA accurately identifies these true effects without producing false positives induced by population structure. In simulations with this data, GAMMA is an improvement over other methods which either fail to detect true effects or produce many false positive identifications. We further apply our method to genetic studies of yeast and gut microbiome from mice and show that GAMMA identifies several variants that are likely to have true biological mechanisms

eScholarship - University of California

Recommended from our members

Meta-analysis identifies gene-by-environment interactions as demonstrated in a study of 4,965 mice.

Author: Davis Richard C
Eskin Eleazar
Furlotte Nicholas
Han Buhm
Joo Jong Wha J
Kang Eun Yong
Lusis Aldons J
Shih Diana
Publication venue: eScholarship, University of California
Publication date: 01/01/2014
Field of study

eScholarship - University of California